Multi-bit-per-cell three-dimensional one-time-programmable memory

ABSTRACT

A multi-bit-per-cell three-dimensional read-only memory (3D-OTPMB) comprises a plurality of OTP cells stacked above a semiconductor substrate. Each OTP cell comprises an antifuse layer, which is irreversibly switched from a high-resistance state to a low-resistance state during programming. By adjusting the programming current, the programmed antifuses have different resistances.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority from Chinese Patent Application 201610238012.7, filed on Apr. 14, 2016, in the State Intellectual Property Office of the People's Republic of China (CN), the disclosure of which is incorporated herein by reference in its entirety.

BACKGROUND 1. Technical Field of the Invention

The present invention relates to the field of integrated circuit, and more particularly to one-time-programmable memory (OTP).

2. Prior Art

Three-dimensional one-time-programmable memory (3D-OTP) is a monolithic semiconductor memory. It comprises a plurality of vertically stacked OTP cells. In a conventional OTP, the memory cells are formed on a two-dimensional (2-D) plane (i.e. on a semiconductor substrate). In contrast, the memory cells of the 3D-OTP are formed in a three-dimensional (3-D) space. The 3D-OTP has a large storage density and a low storage cost. Because the 3D-OTP has a long data retention (>100 years), it is suitable for long-term data storage.

U.S. Pat. No. 5,835,396 issued to Zhang on Nov. 10, 1998 discloses a 3D-OTP 00. It comprises a semiconductor substrate 0 and a plurality of OTP memory levels 100, 200 stacked above the semiconductor substrate 0. Among them, the memory level 200 is stacked above the memory level 100. Transistors in the substrate 0 and interconnects thereof form a substrate circuit (including the peripheral circuit of the OTP memory levels 100, 200). Each OTP memory level (e.g. 100) comprises a plurality of address lines (e.g. word lines 20 a, 20 b . . . , and bit lines 30 a, 30 b . . . ) and memory cells (e.g. 1 aa-1 bb . . . ). Each OTP memory level 100 further comprises a plurality of OTP arrays. Each OTP array is a collection of all OTP cells which share at least one address line. Contact vias 20 av, 30 av couple the address lines 20 a, 30 a with the substrate 0.

The 3D-OTP in Zhang is a single-bit-per-cell 3D-OTP, wherein each 3D-OTP cell stores a single bit. Namely, each OTP cell has two states ‘1’ and ‘0’: the ‘1’ OTP cell is in a low-resistance state, whereas the ‘0’ cell is in a high-resistance state. To further improve the storage density and lower the storage cost, it is desired to store more bits in each 3D-OTP cell.

OBJECTS AND ADVANTAGES

It is a principle object of the present invention to provide a 3D-OTP with a large storage capacity.

It is a further object of the present invention to provide a 3D-OTP with a low storage cost.

It is a further object of the present invention to provide a properly working 3D-OTP even with leaky OTP cells.

It is a further object of the present invention to provide a properly working 3D-OTP even under external interferences.

In accordance with these and other objects of the present invention, the present invention discloses a multi-bit-per-cell 3D-OTP.

SUMMARY OF THE INVENTION

The present invention discloses a multi-bit-per-cell three-dimensional one-time-programmable memory (3D-OTP_(MB)). It comprises a plurality of OTP cells stacked above a semiconductor substrate. Each OTP cell comprises an antifuse layer, which is irreversibly switched from a high-resistance state to a low-resistance state during programming. By adjusting the magnitude of the programming currents, the programmed antifuses have different resistances. Using the resistance to represent the digital states, the OTP cells have N (N>2) states: 0, 1, . . . N−1, whose respective resistances are R_(O), R₁, . . . R_(N-1), with R₀>R₁> . . . >R_(N-1). Having N states, each OTP cell stores more than one bit.

To minimize read error due to leaky OTP cells, the present invention discloses a full-read mode. For the full-read mode, the states of all OTP cells on a selected word line are read out during a read cycle. The read cycle includes two read phases: a pre-charge phase and a read-out phase. During the pre-charge phase, all address lines (including all word and all bit lines) in an OTP array are charged to an input bias voltage of an amplifier associated with the OTP array. During the read-out phase, after its voltage is raised to the read voltage V_(R), a selected word line starts to charge all bit lines through the associated OTP cells. By measuring the voltage change on the bit lines, the states of the associated OTP cells can be determined.

To minimize read error due to external interferences, the present invention further discloses a differential amplifier for measuring the states of the OTP cells. One input of the differential amplifier is the bit-line voltage V_(b) from a data OTP cell (i.e. the OTP cell that stores data), while the other input is a reference voltage V_(ref) from a dummy OTP cell (i.e. the OTP cell that provides V_(ref) for the differential amplifier). Like the data OTP cells, the dummy OTP cells have N states. The value of the reference voltage (e.g. V_(ref,1)) is between the voltages (e.g. V_(‘0’), V_(‘1’)) on the bit lines associated with the dummy OTP cells in adjacent states (e.g. ‘0’, ‘1’), preferably equal to the average of the two. To determine the state of a selected data OTP cell, N−1 measurements are taken. The data OTP cell is in the state ‘k’ if V_(ref,k-1)<V_(b)<V_(ref,k) (k=1, 2, . . . N−1).

Accordingly, the present invention discloses a multi-bit-per-cell 3D-OTP (3D-OTP_(MB)), comprising: a semiconductor substrate including transistors thereon; a plurality of OTP cells stacked above said semiconductor substrate, each of said OTP cells comprising an antifuse layer, where said antifuse layer is irreversibly switched from a high-resistance state to a low-resistance state during programming; a plurality of contact vias coupling said OTP cells to said semiconductor substrate; wherein said OTP cells have more than two states, the OTP cell in different states being programmed by different programming currents.

The present invention further discloses a multi-bit-per-cell 3D-OTP (3D-OTP_(MB)), comprising: a semiconductor substrate including transistors thereon; a plurality of OTP cells stacked above said semiconductor substrate each of said OTP cells comprising an antifuse layer, where said antifuse layer is irreversibly switched from a high-resistance state to a low-resistance state during programming; a plurality of contact vias coupling said OTP cells to said semiconductor substrate; wherein the resistance of said antifuse layer is determined by a programming current, said OTP cells being programmed by at least two programming currents.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a perspective view of a 3D-OTP.

FIGS. 2A-2D are cross-sectional views of four preferred 3D-OTP cells in different states.

FIG. 3 is a symbol of an OTP cell.

FIG. 4A shows a current-voltage (I-V) characteristics of four preferred OTP cells in different states; FIG. 4B shows a relationship between the resistance of the programmed antifuse and the programming current (R_(AF)-I_(P)).

FIGS. 5A-5C are cross-sectional views of three preferred 3D-OTP cells;

FIG. 6A is a circuit block diagram of a preferred OTP array in the full-read mode; FIG. 6B is its signal timing diagram.

FIG. 7A is a circuit block diagram of a first preferred OTP array comprising differential amplifiers; FIGS. 7B-7C are the signal timing diagrams for the reference voltages and the bit-line voltages, respectively.

FIG. 8A is a circuit block diagram of a second preferred OTP array comprising a differential amplifier; FIG. 8B is its signal timing diagram.

FIG. 9 is a preferred 3D-OTP comprising a dummy word line.

It should be noted that all the drawings are schematic and not drawn to scale. Relative dimensions and proportions of parts of the device structures in the figures have been shown exaggerated or reduced in size for the sake of clarity and convenience in the drawings. The same reference symbols are generally used to refer to corresponding or similar features in the different embodiments. In FIG. 6A, FIG. 7A, FIG. 8A and FIG. 9, solid dots represent programmed OTP cells, while open dots represent unprogrammed OTP cells. The symbol “/” means a relationship of “and” or “or”.

Throughout the present invention, the phrase “on the substrate” means the active elements of a circuit are formed on the surface of the substrate, although the interconnects between these active elements are formed above the substrate and do not touch the substrate; the phrase “above the substrate” means the active elements are formed above the substrate and do not touch the substrate.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Those of ordinary skills in the art will realize that the following description of the present invention is illustrative only and is not intended to be in any way limiting. Other embodiments of the invention will readily suggest themselves to such skilled persons from an examination of the within disclosure.

Referring now to FIG. 2A-2D, four preferred OTP cells 1 aa-1 ad are disclosed. They are in the states: ‘0’, ‘1’, ‘2’, ‘3’, respectively. Each OTP cell (e.g. 1 aa) comprises a top electrode 30 a, a bottom electrode 20 a, an antifuse layer 22 and a quasi-conductive layer 24. The antifuse layer 22 has high resistance before programming (FIG. 2A), and is irreversibly switched to low resistance after programming (FIGS. 2B-2D). The quasi-conductive has the following properties: its resistance at the read voltage (i.e. the read resistance) is substantially lower than when the applied voltage has a magnitude smaller than or polarity opposite to that of the read voltage.

Because the OTP cell 1 aa is unprogrammed, no conductive filament is formed in its antifuse layer 22. On the other hand, because the OTP cells 1 ab-1 ad are programmed, conductive filaments 25 x-25 z of different sizes are formed therein. Among them, the conductive filament 25 x of the OTP cell 1 ab is thinnest and has the largest resistance; the conductive filament 25 z of the OTP cell 1 ad is thickest and has the lowest resistance; the conductive filaments 25 y of the OTP cell 1 ac has an intermediate size and therefore, has an intermediate resistance.

FIG. 3 is a symbol of an OTP cell 1. It comprises an antifuse 12 and a diode 14. The antifuse 12 comprises the antifuse layer 22 and its resistance is irreversibly switched from high to low during programming. The diode 14 comprises the quasi-conductive layer 24 and is broadly interpreted as any two-terminal device whose resistance at the read voltage is substantially lower than when the applied voltage has a magnitude smaller than or polarity opposite to that of the read voltage.

Referring now to FIGS. 4A-4B, the electrical characteristics of the ‘0’-‘3’ OTP cells are disclosed. FIG. 4A shows current-voltage (I-V) characteristics of the ‘0’-‘3’ OTP cells. The I-V curve 130 corresponds to a ‘0’ OTP cell 1 aa; the I-V curve 131 corresponds to a ‘1’ OTP cell 1 ab; the I-V curve 132 corresponds to a ‘2’ OTP cell 1 ac; the I-V curve 133 corresponds to a ‘3’ OTP cell 1 ad. The diode 14 has a turn-on voltage V_(on). Once the applied voltage is larger than V_(on), the resistance of the diode 14 drops rapidly. At this time, the resistance of the OTP cell 1 primarily comes from the antifuse layer 12.

FIG. 4B shows a relationship between the resistance of the programmed antifuse (R_(AF)) and the programming current (I_(P)). As R_(AF) is inversely proportional to I_(P), the programmed antifuse would have different R_(AF) by adjusting I_(P). For the state ‘1’, the programming current I_(P1) is relatively small, thus the antifuse resistance R₁ is relatively large. For the state ‘3’, the programming current I_(P3) is relatively large, thus the antifuse resistance R₃ is relatively small. The state ‘2’ is between the states ‘1’ and ‘3’. Overall, I_(p1)<I_(p2)<I_(p3) leads to R₁>R₂>R₃.

Referring now to FIGS. 5A-5C, three preferred 3D-OTP cells 1 aa are shown. In the preferred embodiment of FIG. 5A, the bottom electrode (word line) 20 a comprises a metallic or highly doped semiconductor material. The top electrode (bit line) 30 a comprises a metallic or highly doped semiconductor material. The antifuse layer 22 is a layer of insulating dielectric (e.g. silicon oxide, silicon nitride). The quasi-conductive layer 24 is used to form a diode 14. For a semiconductor diode 14, the bottom electrode 20 a comprises a P+ semiconductor material, the quasi-conductive layer 24 comprises an N− semiconductor material, while the top electrode 30 a comprises an N+ semiconductor material. Alternatively, the bottom electrode 20 a comprises a metallic material, the quasi-conductive layer 24 comprises a P+/N−/N+ diode, while the top electrode 30 a comprises another metallic material. For a Schottky diode 14, the bottom electrode 30 a comprises a metallic material, the quasi-conductive layer 24 comprises an N− semiconductor material, while the top electrode 30 a comprises an N+ semiconductor material. For a ceramic diode 14, the bottom electrode 30 a comprises a metallic material, the quasi-conductive layer 24 comprises a ceramic material (e.g. a layer of metal oxide), while the top electrode 30 a comprises another metallic material.

The preferred embodiment of FIG. 5B is similar to that of FIG. 5A, except that a conductive layer 26 separates the antifuse layer 22 from the quasi-conductive layer 24. The conductive layer 26 preferably comprises at least a metallic material, which minimizes heat damage to the quasi-conductive layer 24 during programming. The preferred embodiment of FIG. 5C is even simpler than those of FIGS. 5A-5B in that it does not comprise a separate quasi-conductive layer. After the antifuse layer 22 is ruptured, a diode is naturally formed at the junction of the top electrode 30 a and the bottom electrode 20 a. As an example, the bottom electrode 20 a comprises a highly doped P+ semiconductor material and the top electrode 20 a comprises a highly doped N+ semiconductor material. For those skilled in the art, the OTP cell 1 aa could take other forms.

To minimize read error due to leaky OTP cells, the present invention discloses a full-read mode. For the full-read mode, all OTP cells on a selected word line are read out during a read cycle T. FIGS. 6A-6B discloses a preferred OTP array 0A in the full-read mode. The OTP array 0A comprises word lines 20 a-20 z, bit lines 30 a-30 z and OTP cells 1 aa-1 zz (the numbers in the parenthesis represent the states of the OTP cells) (FIG. 6A). The preferred embodiment further comprises a single-ended amplifier 58S. The read cycle T includes two read phases: a pre-charge phase t_(pre) and a read-out phase t_(R) (FIG. 6B). During the pre-charge phase t_(pre), all address lines 20 a-20 z, 30 a-30 z in the OTP array 0A are charged to an input bias voltage V_(i), of the amplifier 58S.

During the read-out phase t_(R), all bit lines 30 a-30 z are floating. Based on the row address 52A, the row decoder 52 raises the voltage on a selected word line 20 a to the read voltage V_(R), while voltage on unselected word lines 20 b-20 z remains at the input bias voltage V_(i). After this, the selected word line 20 a starts to charge the bit lines 30 a-30 z through the OTP cells 1 aa-1 az and the voltages on the bit lines 30 a-30 z begin to rise. At this time, the voltage on each bit line is sent to the amplifier 58S by rotating the column address 54A. For each column address 54A, the column decoder 54 selects a bit line (e.g. 30 b) and sends its voltage V_(b) to the input 51 of the amplifier 58S. When the value of the voltage V_(b) exceeds the threshold voltage V_(T) of the amplifier 58S, the output 55 is toggled. By measuring the toggling time, the state of each OTP cell (e.g. the OTP cell 1 ab at the intersection of the selected word line 20 a and the selected bit line 30 b) can be determined.

During the above measurement, because the V_(T) of the amplifier 58S is relatively small (˜0.1V or smaller), the voltage changes delta(V) on the bit lines 30 a-30 z are small. The largest voltage change delta(V)_(max)˜N*V_(T) is far less than the read voltage V_(R). As long as the I-V characteristics of the OTP cell satisfies I(V_(R))>>I(−N*V_(T)), the 3D-OTP_(MB) would work properly even with leaky OTP cells.

To minimize read error due to external interferences, the present invention further discloses differential amplifiers for measuring the states of the OTP cells. FIGS. 7A-C disclose a first preferred OTP array 0A with differential amplifiers. In addition to the regular bit line 30 a-30 z and OTP cells 1 aa-1 zz (FIG. 6A), the preferred OTP array 0A further comprises dummy bit lines 31 a-31 f and dummy OTP cells 1 a 0-1 z 5 (FIG. 7A). For clarity, the regular bit line 30 a-30 z are referred to as data bit lines, while the regular OTP cells 1 aa-1 zz are referred to as data OTP cells. The dummy bit lines 31 a-31 f form a dummy bit-line set 30DY, while the data bit lines 30 a-30 z form a data bit-line set 30DT.

This preferred embodiment further comprises N−1 (in this case, =3) differential amplifiers 58 a-58 c (FIG. 7A). All differential amplifiers 58 a-58 c comprise two inputs: first inputs are the bit-line voltage V_(b) from a data OTP cell (i.e. the OTP cell stores data), while second inputs are the reference voltages V_(ref,1)−V_(ref,3). For example, the reference voltage for the differential amplifier 58 a is V_(ref,1)˜(V_(‘0’)+V_(‘1’))/2; the reference voltage for the differential amplifier 58 b is V_(ref,2)˜(V_(‘1’)+V_(‘2’))/2; and, the reference voltage for the differential amplifier 58 c is V_(ref,3)˜(V_(‘2’)+V_(‘3’))/2. As used herein, V_(‘i’) (i=0, 1, 2, or 3) is the voltage on the associated bit line when reading out a state ‘I’ OTP cell.

To generate these reference voltages V_(ref,1)-V_(ref,3), the OTP array 0A uses 2N−2 (in this case, =6) dummy bit lines 31 a-31 f. Each word line (e.g. 20 a) is associated with 2N−2 (in this case, =6) dummy OTP cells (e.g. 1 a 0-1 a 5). Like the data OTP cells 1 aa-1 az, the dummy OTP cells 1 a 0-1 a 5 have N states. For example, the dummy OTP cells 1 aa 0-1 a 5 on the word line 20 a are in the states ‘0’, ‘1’, ‘1’, ‘2’, ‘2’, ‘3’, ‘3’, respectively (FIG. 7A). The reference voltage V_(ref,1) on the second input 53 a of the amplifier 58 a is generated by shorting the dummy bit lines 31 a (coupled with a ‘0’ dummy OTP cell 1 a 0) and 31 b (coupled with a ‘1’ dummy OTP cell 1 a 1). Namely, V_(ref,1)=(V_(31a)−V_(31b))/2=(V_(‘0’)+V_(‘1’))/2(FIG. 7B). The reference voltage V_(ref,2) on the second input 53 b of the amplifier 58 b is generated by shorting the dummy bit lines 31 c (coupled with a ‘1’ dummy OTP cell 1 a 2) and 31 d (coupled with a ‘2’ dummy OTP cell 1 a 3). Namely, V_(ref,2)=(V_(31c)+V_(31d))/2=(V_(‘1’)+V_(‘2’))/2. The reference voltage V_(ref,3) on the second input 53 c of the amplifier 58 c is generated by shorting the dummy bit lines 31 e (coupled with a ‘2’ dummy OTP cell 1 a 4) and 31 f (coupled with a ‘3’ dummy OTP cell 1 a 5). Namely, V_(ref,3)=(V_(31e)+V_(31f))/2=(V_(‘2’)+V_(‘3’))/2.

To determine the state of a selected data OTP cell, N−1 measurements are taken concurrently at the N−1 amplifiers 58 a-58 c. The data OTP cell is in the state ‘k’ if V_(ref,k-1)<V_(b)<V_(ref,k)(k=1, 2, . . . N−1). For example, to measure the state of the data OTP cell 1 ab, the column decoder 54 sends the voltage on the bit line 30 b to the first inputs of all amplifiers 58 a-58 c. The amplifiers 58 a-58 c make three measurements concurrently (FIG. 7C). As the data OTP cell 1 ab is in the state ‘2’, the bit-line voltage V_(b) on the first input 51 is larger than the reference voltages V_(ref,1), V_(ref,2) on the second inputs 53 a, 53 b of the amplifiers 58 a, 58 b, respectively. However, V_(b) is smaller than V_(ref,3) on the second input 53 c of the amplifier 58 c. Thus, the outputs 55 a-55 c of the amplifiers 58 a-58 c are high, high, low. Accordingly, the state of the data OTP cell 1 ab can be determined.

FIGS. 8A-8B discloses a second preferred OTP array 0A comprising an amplifier. Instead of three amplifiers 58 a-58 c (FIG. 7A), it uses a single amplifier 58D (FIG. 8A). The first input of the amplifier 58D is the bit-line voltage V_(b) of a selected data OTP cell, while the second input is a selected reference voltage V_(ref). This preferred embodiment uses fewer dummy bit lines and dummy OTP cells than FIG. 7A. The OTP array 0A comprises only N (in this case, =4) dummy bit lines 32 a-32 d and each word line (e.g. 20 a) is coupled with N dummy OTP cells (e.g. 1 a 0-1 a 3). Like the data OTP cells 1 aa-1 az, the dummy OTP cells 1 a 0-1 a 3 have N states. For example, the dummy OTP cells 1 aa 0-1 a 3 on the word line 20 a are in the states 0, 1, 2, 3, respectively (FIG. 8A). Adjacent dummy bit lines (e.g. 32 a, 32 b) are coupled through a pair of pass transistors (e.g. 56 a 1, 56 a 2), which is controlled by a control signal 56 a. An appropriate reference voltage V_(ref) is generated by asserting the corresponding control signals. For example, by asserting the control signal 56 a, the passing transistors 56 a 1, 56 a 2 is turned on and the reference voltage V_(ref) is equal to the average of the voltages on the dummy bit lines 32 a, 32 b. Namely, V_(ref)=V_(32a)+V_(32b)=(V_(‘0’)+V_(‘1’))/2.

To determine the state of a selected data OTP cell, N−1 measurements are taken sequentially at the amplifier 58D (FIG. 8B). The OTP cell is in the state ‘k’ if V_(ref,k-1)<V_(b)<V_(ref,k)(k=1, 2, . . . N−1). For example, to determine the state of the data OTP cell 1 ab, the column decoder 54 sends the voltage V_(b)(=V_(‘2’)) on the bit line 30 b to the first input 51 of the amplifier 58D. Three sequential measurements are taken. During the first measurement T₁, the control signal 56 a is asserted to turn on the pass transistors 56 a 1, 56 a 2. The reference voltage V_(ref) on the second input 53 of the amplifier 58D is V_(ref)=(V_(‘0’)+V_(‘1’))/2. As V_(ref)<V_(b), the output 55 of the amplifier 58D is high. Similarly, during the second measurement T₂, by turning on the pass transistors 56 b 1, 56 b 2, the output 55 of the amplifier 58D stays high as V_(ref)=(V_(‘1’)+V_(‘2’))/2<V_(b); during the third measurement T₃, by turning on the pass transistors 56 c 1, 56 c 2, the output 55 of the amplifier 58D turns low as V_(ref)=(V_(‘2’)+V_(‘3’))/2>V_(b). By analyzing the results from three measurements T₁-T₃, the state of the data OTP cell 1 ab can be determined.

In the preferred embodiments of FIGS. 6A-8B, the row decoder 52, the column decoder 54 and the amplifier 58S, 58 a-58 c, 58D are formed on the substrate 0 and are a portion of the substrate circuit OK. The OTP array 0A is stacked above the substrate circuit OK and covers at least a portion thereof. The 3D-OTP_(MB) has a small die size and a low die cost.

All dummy OTP cells need to be pre-programmed before shipping. During pre-programming, the resistances of the dummy OTP cells need to be adjusted precisely. For the preferred embodiment of FIG. 7A (or, FIG. 8A), each word line 20 a is associated with 2N−2 (or, N) dummy OTP cells. The total number of the dummy OTP cells in the OTP array 0A (with M data word lines and N data bit lines) is (2N−2)*M (or, N*M). Pre-programming all of them is time consuming and costly. To reduce the pre-programming time, the present invention further discloses a dummy word line. FIG. 9 discloses a preferred OTP array 0A comprising a dummy word line 20D. In addition to the data word lines 20 a, 20 b, . . . 20 y, 20 z, the preferred OTP array 0A comprises a dummy word line 20D. The pre-programming is only carried out to the dummy OTP cells 1D1-1D5 located at the intersections of the dummy word line 20D and the dummy bit lines 31 b-31 f. Because majority of the dummy OTP cells (e.g. 1 a 0-1 b 5, 1 y 0-1 z 5) do not need pre-programming, the pre-programming time is significantly reduced.

During read, both voltages on the selected data word line (e.g. 20 a) and the dummy word line 20D are raised to V_(R). Because the dummy OTP cells 1Da-1Dz at the intersections of the dummy word line 20D and the data bit lines 30 a-30 z are unprogrammed, the voltage rise on the dummy word line 20D would not affect the signals on the data bit lines 30 a-30 z. Moreover, because the dummy OTP cells 1 a 0-1 a 5 at the intersections of the data word line 20 a and the dummy bit lines 31 a-31 f are unprogrammed, the voltage rise on the data word line 20 a would not affect the signals on the dummy bit lines 31 a-31 f, either. Accordingly, the operation of this preferred embodiment is similar to those in FIGS. 7A-8B.

In the preferred embodiments of FIGS. 7A-9, to manufacture high-quality dummy OTP cells, all dummy word line, dummy bit lines and dummy OTP cells are preferably formed in the middle of the OTP array 0A. For example, the dummy word line 20D is formed in the middle of the OTP array 0A, so are the dummy bit lines 31 a-31 f.

Although examples disclosed in these figures are horizontal 3D-OTP (i.e. the OTP memory levels 100, 200 are horizontal), the inventive spirit can be extended to vertical 3D-OTP (i.e. the OTP memory strings are vertical to the substrate). More details of the vertical 3D-OTP are disclosed in Chinese Patent Application No. 201610234999.5, filed on Apr. 16, 2017.

While illustrative embodiments have been shown and described, it would be apparent to those skilled in the art that many more modifications than that have been mentioned above are possible without departing from the inventive concepts set forth therein. For example, beside N=4 (i.e. each OTP cell stores two bits), the present invention can be extended to N=8 (i.e. each OTP cell stores three bits) or more. The invention, therefore, is not to be limited except in the spirit of the appended claims. 

What is claimed is:
 1. A multi-bit-per-cell three-dimensional read-only memory (3D-OTP_(MB)), comprising: a semiconductor substrate including transistors thereon; an OTP array stacked above said semiconductor substrate, said OTP array comprising a plurality of OTP cells including a first unprogrammed dummy OTP cell, second and third programmed dummy OTP cells, each of said OTP cells comprising an antifuse layer, wherein said second and third programmed dummy OTP cells have different states; a first dummy bit line associated with said first unprogrammed dummy OTP cell; a second dummy bit line associated with said second programmed dummy OTP cell; a third dummy bit line associated with said third programmed dummy OTP cell; a plurality of contact vias coupling said OTP cells to said semiconductor substrate; a differential amplifier with an input disposed on said semiconductor substrate, wherein said input is coupled with said first and second dummy bit lines during a first measurement, and said input is coupled with said second and third dummy bit lines during a second measurement; wherein said OTP cells have more than two states, the OTP cell in different states being programmed by different programming currents.
 2. The 3D-OTP_(MB) according to claim 1, wherein said OTP cells are programmed by at least two programming currents.
 3. The 3D-OTP_(MB) according to claim 1, wherein said second dummy OTP cell has a larger resistance than said third dummy OTP cell.
 4. The 3D-OTP_(MB) according to claim 1, wherein the programming current of said second dummy OTP cell is smaller than the programming current of said third dummy OTP cell.
 5. A multi-bit-per-cell three-dimensional read-only memory (3D-OTP_(MB)), comprising: a semiconductor substrate including transistors thereon; an OTP array stacked above said semiconductor substrate, said OTP array comprising a plurality of OTP cells including a data OTP cell, a plurality of word lines including a data word line, and a plurality of bit lines including a data bit line, each of said OTP cells comprising an antifuse layer; a dummy word line in parallel with said data word line; a dummy bit line in parallel with said data bit line; a first dummy OTP cell formed at the intersection of said dummy word line and said dummy bit line, wherein said first dummy OTP cell is programmed; a second dummy OTP cell formed at the intersection of said data word line and said dummy bit line, wherein said second dummy OTP cell is unprogrammed; a plurality of contact vias coupling said OTP cells to said semiconductor substrate; wherein said OTP cells have N states with N>2, the OTP cell in different states being programmed by different programming currents.
 6. The 3D-OTP_(MB) according to claim 5, wherein said OTP array comprises 2N−2 dummy bit lines.
 7. The 3D-OTP_(MB) according to claim 5, wherein said OTP array comprises N dummy bit lines.
 8. The 3D-OTP_(MB) according to claim 5, further comprising a third dummy OTP cell formed at the intersection of said dummy word line and said data bit line, wherein said third dummy OTP cell is unprogrammed. 